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SUMMARY 

Vocal load plays a significant role in the aetiology of voice disorders and influences the response to treatment. For this reason, many re- 
searchers have focused their attention on how a voice is used, especially when vocal load is increased, during working hours for instance. 
The majority of studies in this regard have been performed by recording vocal parameters for brief periods with the aid of microphones. The 
first devices produced recorded only a few parameters and for relatively short periods of time, and since microphones were used there was 
a problem with both privacy and background noise such as the inclusion of voices from nearby people. Recently, microprocessors that can 
monitor a voice for an entire day have been developed; these use miniaturised accelerometers as vocal sensors. The latest commercially- 
available version is the Ambulatory Phonation Monitor (APM) (KayPENTAX, Lincoln Park, NJ, USA) which can record several vocal 
parameters for over 18 hours and supply a series of graphic representations of the variations in these parameters during the recording period. 
In particular, the APM permits recording vocal load by measuring the cycle dose and distance dose, and evaluates vocal intensity (dB sound 
pressure level [SPL]), fundamental frequency and total phonation time. This report describes the APM, the use of an accelerometer as a 
vocal sensor, the importance of its calibration and the parameters it records. In particular, details are given on phonation time, variations 
in frequency, vocal intensity, phonation density and vocal dose. The role of the APM in voice studies is also described, in addition to its 
potential clinical applications as demonstrated by the few reports available in the literature. We also discuss our experience with the device 
in groups of euphonic and dysphonic elementary school teachers. 

KEY WORDS: Ambulatory Phonation Monitor • APM • Vocal load • Vocal doses • Total phonation time • Parameters of voice 
RIASSUNTO 

// carico vocale gioca un ruolo significativo nel causare disturbi delta voce ed influenzare la risposta al trattamento. Per questa ragione 
molti ricercatori hanno focalizzato la low attenzione su come viene utilizzata la voce soprattutto quando aumenta il carico vocale e quindi 
durante le ore lavorative. La maggior parte degli studi sono stati effettuati sulla base della registrazione di parametri vocali per brevi pe- 
riodi utilizzando del microfoni. I primi strumenti prodotti, registravano pochi parametri vocali solo per alcune ore e V utilizzo di microfoni 
poneva problemi non soltanto di privacy ma anche problemi relativi al rumore difondo ed alia voce di soggetti vicini. Recentemente sono 
stati sviluppati microprocessori capaci di monitorare la voce per V intern giornata utilizzando come sensori vocali, accelerometri miniatu- 
rizzati. U ultima versione prodotta e recentemente messa in commercio e V Ambulatory Phonation Monitor (APM) (KayPENTAX, Lincoln 
Park, NJ), strumento capace di registrare diversi parametri vocali per oltre 18 ore e difornire una serie di elaborazioni grafiche delle va- 
riazioni di tali parametri nel tempo di registrazione. In particolare I APM permette di registrare il carico vocale attraverso la misurazione 
del "cycle dose" e del "distance dose", stima Vintensitd vocale (dB sound pressure level [SPL]), la frequenza fondamentale ed il tempo fo- 
natorio totale. In questo lavoro viene descritto I APM, V accelerometro utilizzato come sensore vocale, Vimportanza della sua calibrazione 
ed i parametri da esso registrati. In particolare verranno descritti il phonation time, le variazioni di frequenza e intensitd vocale, la densitd 
difonazione ed il vocal dose. Inoltre verrd descritto il ruolo dell APM nello studio della voce e le sue potenziali applicazioni cliniche sulla 
base sia dei pochi dati esistenti in letteratura, sia sulla base della nostra esperienza su gruppi di insegnanti eufonici e disfonici. 

PAROLE CHIAVE: Sistema portatile ambulatoriale per il monitoraggio della voce • APM • Carico vocale • Dosi vocali • Tempo totale 
fonatorio • Parametri della voce 
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Introduction 

The majority of investigators maintain that vocal load 
plays a significant role in the cause of voice disorders and 
influences response to treatment. For this reason, clini- 



cians and researchers have focused their attention on how 
a voice is used. Studies that have been published in the past 
on voice usage as a factor of risk for dysphonia have re- 
ferred to briefly performed recordings and/or evaluations. 
As a result, there is little objective information on the real 
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use of the voice throughout the day and, in particular, dur- 
ing working hours. The need for quantitative information 
on the use/abuse of voice has led to the development of 
devices that can monitor voice function over a period of 
several hours 1 2 . The use of the first devices was limited 
from a clinical point of view. It was possible to record on- 
ly a limited number of vocal parameters, which were not 
reliable, and the instruments were invasive and/or were 
not easy to carry around for an entire day. In particular, 
the use of microphones caused problems to privacy con- 
cerns as the devices were sensitive to surrounding noise 
and the voices of nearby people. The recent development 
of microprocessors devised for ambulatory monitoring of 
the voice, and the introduction of miniaturised accelerom- 
eters for detecting vocal parameters has helped to over- 
come many of the aforesaid limitations. When the accel- 
erometer sensor is placed at the jugular level, it captures 
the vibrations of the skin, and therefore the acceleration, 
indirectly supplying a true estimate of vocal parameters 
such as duration of phonation, fundamental frequency and 
vocal intensity. The latter is considered as sound pressure 
level and 'vocal dosing' 3 ~ 5 . Recently, a new device has 
been commercialized, namely the Ambulatory Phona- 
tion Monitor (APM - KayPENTAX, Lincoln Park, NY, 
USA). With the APM it is possible to study, identify and 
improve - by means of biofeedback - habitual voice pat- 
terns (pitch, loudness, amount of voice use). The aim of 
this study is to describe this new device, the importance of 
its calibration, the parameters that it records and its poten- 
tial applications, both on the basis of the limited existing 
data and our brief experience with the device in groups of 
euphonic and dysphonic elementary school teachers. 

Clinical techniques and technology: 
Ambulatory Phonation Monitor (APM) 

The APM is a portable device that allows to objectively 
document voice during an entire day. Specifically, the 
APM measures the amount of time a subject has phonated, 
when the phonation occurred and estimates the subject's 
vocal intensity (dB SPL) and fundamental frequency (FO) 
during all phonation activity. The data essentially provide 
a 'profile' of a subject's 'typical' phonatory behaviour 
during the period of monitoring. The APM can also be 
used to provide immediate, real-time vibrotactile feed- 
back to the patient during daily activities based on settings 
entered by the clinician prior to usage. The rationale for 
this feedback is to assist the patient in establishing and to 
get used to new phonatory behaviours outside a clinical 
environment. This possibility of having a pre-set vibro- 
tactile feedback helps the patient to follow the indications 
recommended by the speech therapist and automates con- 
trol of frequency and intensity, factors that might play an 
important role in the cause of voice disorder. 
The APM uses a miniaturised accelerometer as a voice 



sensor, mounted on a silicone pad, attached to the neck 
at the jugular level using surgical adhesive. Svec et al. 
(2005) demonstrated that the accelerometer can indicate 
the mean SPL of soft, comfortable or loud voices with an 
accuracy of over ±2.8 dB in 95% of cases if these were 
previously and individually calibrated 5 . Later, Hillman et 
al. (2006) demonstrated that in the case of fundamental 
frequency, vocal intensity (SPL) and phonation time, the 
accelerometer can supply data that were essentially super- 
imposable with those captured through a traditional mi- 
crophone, both in normophonic subjects and in dysphonic 
individuals with mild and severe dysphonia 6 . 
In order to obtain a reliable evaluation of vocal intensity, 
each patient must calibrate the sensor; the signal detected 
by the accelerometer will be able to supply true data con- 
cerning the sound pressure level only if calibration is per- 
formed. The subject is placed in front of the calibration 
microphone at a distance of 15 cm. Participants should be 
instructed to take a deep breath and sustain the /a/ vowel, 
starting with a soft voice and steadily increasing volume 
until the loudest voice he/she can produce is reached. 
Patients who cannot sustain a long enough phonation to 
cover the soft to loud range in one breath, should be in- 
structed to produce the /a/ vowel for 1 or 2 seconds with a 
soft, medium and loud voice, inhaling between each pro- 
duction. Published data report an average speaking SPL 
range of 35 dB during calibration 7 . However, the exact 
range varies from one subject to another. As soon as the 
subject starts to phonate, calibration data points will show 
up on the display, along with a straight red line which rep- 
resents the best linear correlation between the sound pres- 
sure levels recorded by the microphone and the amplitude 
of the signal captured by the accelerometer on the neck of 
the patient. The software will trace a straight red line after 
at least 7 data points have appeared; this does not mean 
that the calibration has been performed well. The software 
provides an error message if the best-fit red line drawn on 
the available points is not statistically valid, but not on 
the basis of the range reached by the patient. Therefore, 
the examiner must invite the patient to continue with the 
phonation until his/her full amplitude range is reached. 
The APM can record many parameters, which are de- 
scribed below. 

Phonation time 

Phonation time is the total duration of phonation expressed 
as the total time (hours, minutes and seconds) and as the 
percentage of time (%) spent phonating for the recording 
period. Hence, calculation of the phonation time excludes 
pauses for breathing and all moments when the patient is 
not phonating. Using the vocal dosimeters, it is possible 
to evaluate the percentage of phonating time during work- 
ing hours and compare these percentages with those refer- 
ring to non- working time and during weekends. The APM 
also gives a graphic view of the phonation time, the so- 



50 



The use and role of the Ambulatory Phonation Monitor (APM) in voice assessment 



called Phonation Time Profile Graph (Fig. la), which is a 
time-history display of the entire recording period: time 
is displayed on the horizontal axis and the percentage of 
time phonating is shown in green; the percent phonation 
scale (% values) is displayed on the vertical axis on the 
left edge of screen, and average amplitude (dB) is repre- 
sented by a black line with the associated dB SPL units 
displayed vertically on the right edge of the screen. The 
Phonation Time Profile uses a moving average to display 
the data. Five minutes is the default time interval over 
which data is averaged for display. The user can select 
subintervals of time to be viewed on the Phonation Time 
Profile graph (Fig. lb). 

Phonation Density Graph 

The Phonation Density Graph is a three-dimension graph 
that shows the amount of phonation (total duration) with 
reference to the SPL (x-axis) and the FO (y-axis). The 
colourimetric scale on the right of the graph indicates 
the density of the phonation at a given fundamental fre- 
quency (FO) and amplitude (z-axis). More often than not, 
a couple of SPL and FO are used and in this case, the col- 
our (see the z-axis) at the intersection of these intensity 



and frequency values will be deeper. The more the area is 
circumscribed the higher the colourimetric value will be, 
especially if the patient has used limited frequency and 
intensity values (coupled), keeping the manner of voice 
usage constant (Fig. 2a). If there are widespread areas of 
high density, it means that the subject has used extremely 
variable fundamental frequency and intensity values dur- 
ing the recording period (Fig. 2b). Euphonic subjects can 
generally keep FO and SPL constant when they use their 
voice in different circumstances (during work, during in- 
tervals from work, after work and during weekends). In 
contrast, dysphonic patients cannot keep the FO and SPL 
values constant and their relative graph will show wide- 
spread areas of high density. 

Sound pressure level and fundamental frequency 
histogram 

These graphs show the distribution of the SPL and FO 
levels, respectively, compared with the percentage of total 
voicing. The highest SPL or FO column in the graph cor- 
responds to the values of the vocal intensity and the FO 
that have been used most during the recording period. If 
there are many columns of similar height in the graph, this 
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Fig. 1 . Phonation Time Profile Graph. Phonation time profile of an entire re- 
cording period (10:03 a.m. - 10:03 p.m.) (A) and of selected subintervals of 
time (2:30-4:00 p.m.) during the same exam (B). 
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Fig. 2. Phonation Density Graph. Normophonic subject: the area is circum- 
scribed (limited F0 and SPL values (coupled), keeping the manner of voice 
usage constant) (A). Patient with dysfunctional dysphonia: there are wide- 
spread areas of high density (the subject has used extremely variable F0 and 
SPL values during the recording period) (B). 
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indicates that the subject has used 
different vocal intensities and FO 
in the total period shown (Fig. 3). 
This situation in the graph, like 
the phonation density, might be 
due to the subject's impossibility 
to keep constant SPL and FO val- 
ues under the different phonation 
conditions. 

Fundamental frequency /SPL den- 
sity scatter plots 

These graphs show the distribu- 
tion of FO and SPL, respectively, 
over the time period. In other 
words, they show the variation 
in FO and vocal intensity over 
time during recording (red line) 
(Fig. 4). 

Vocal dose (vocal load) 
Vocal loading is a combination 
of prolonged voice use and ad- 
ditional loading factors (back- 
ground noise, acoustics, air qual- 
ity) affecting the fundamental 
frequency, the type and loudness 
of phonation and the vibratory 
characteristics of the vocal folds 8 . 
Therefore, one of the most funda- 
mental issues when studying the 
effects of long-term vocalization 
is determination of the proper 
way of quantifying the amount 
of voicing. For this purpose, 'vo- 
cal doses' as an expression of 
vocal load have been introduced 
over the past 10 years 9 . The term 
'dose' in this case refers to the ef- 
fect of long-term exposure of the 
vocal fold mucosa to vibrations. 
Vocal load and the relative 'vocal 
doses' can be identified with three 
different parameters: time dose 
(D t ), cycle dose (D c ) and distance 
dose (D d ). 

Time dose (D t ) is the same as 
voicing time and measures the to- 
tal time the vocal folds have spent 
vibrating. This is calculated as 
follows: 
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Fig. 3. Sound Pressure Level and Fundamental Frequency Histogram. The highest SPL or FO column in the 
graph corresponds to values of vocal intensity and the fundamental frequency used most during the record- 
ing period. Normophonic subject: only two columns are similar in height (limited FO and SPL values keeping 
the manner of voice usage constant) (A, B). Patient with dysfunctional dysphonia: several columns of similar 
height in the graph indicate that the subject has used different vocal intensities and fundamental frequen- 
cies in the period shown (C, D). 
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Fig. 4. Fundamental Frequency/SPL Density Scatter Plots. These graphs show the distribution of FO and 
SPL, respectively, over time (line). The graphs also show the frequency density and the sound pressure level 
density, coloured in grey and superimposed over the mean values that can be seen in the tracing. Normo- 
phonic subject: FO and SPL do not vary significantly over time of recording (A, B). Patient with dysfunctional 
dysphonia: a progressive increase in FO and a progressive contemporary decrease in SPL (C, D) is apparent. 
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&t = J k v dt seconds 

where t is the total data collection time in seconds, k is 

p V 

the voicing unit step function 9 10 . 

The cycle dose measures the number of oscillations of the 
vocal folds during the recording period; it depends both 
on FO and the total phonation time, and is calculated as 
follows: 

f*r 

D c = I K F odt cycles 
Jo 

where t is the total data collection time in seconds, k is 

p V 

the voicing unit step function and F Q is the FO in Hertz 910 . 
The distance dose (D c ) is the 'total distance accumulat- 
ed by the vocal folds in a cyclic path during vibration; 
it depends not only on the total phonation time and FO, 
but also on the amplitude of the vocal fold vibration and, 
therefore, the vocal intensity (dB SPL). It is calculated as 
follows: 

[*P 

D d = 4 J k v AF Q dt meters 

where A is the amplitude of the vocal folds. 
The distance dose parameter (D d ) is particularly interest- 
ing because it can calculate the safe limits just as they are 
used in industries for assessing vibrations transmitted to 
the hands of operators. The safe limit for hand tissues is 
approximately 520 metres of accumulated distance, and 
therefore any exposure to vibration that exceeds this limit 
is considered to be a risk factor. Likewise, in the future it 
will be possible to establish the safe limits for vocal folds 
exposed to vibrations, in order to reduce the risk of voice 
disorders 4 8 10 . 

We describe below our personal experience with the 
APM. In our Unit, we studied 10 female elementary 
school teachers (mean age: 42.4 ± 7.2 yrs); the anamnesis 
of 5 teachers (mean age: 42.0 ± 5.2 yrs) was negative for 
voice disorders, and phoniatric and laryngostroboscopic 
examinations showed no laryngeal dysfunction nor any 
signs of vocal fold disorders; on the other hand, 5 teach- 



ers (mean age: 42.8 ± 8.8 yrs) had vocal fold nodules. 
The participants were monitored using the KayPENTAX 
Ambulatory Phonation Monitor, Model 3200, for one 
week. The APM was calibrated for each participant using 
the procedures described, and each participant was moni- 
tored for their entire workday. 

During their five-hour working day, the five healthy 
teachers showed progressively increased F0 (mode and 
average) and intensity values, and this increase became 
statistically significant as they reached their fourth hour 
of teaching. In contrast, all the teachers with vocal nod- 
ules all showed a progressive decrease in intensity, F0 
mode and F0 average, which became statistically signifi- 
cant in all three parameters as they reached the fifth hour. 
These results are summarized in Table I. The percentage 
of phonation time did not change significantly in the two 
subgroups. The distance dose value was slightly reduced 
in the group with vocal nodules, but without reaching sta- 
tistically significant differences. 

Discussion 

The APM is a portable device that objectively documents 
voice use during an entire day of normal activity. Specifi- 
cally, the APM measures the amount of time a subject has 
phonated, when the phonation occurred and estimates the 
subject's vocal intensity and fundamental frequency dur- 
ing all phonation activity. These data essentially provide 
a 'profile' of a subject's 'typical' phonatory behaviour 
during the period of monitoring. The potential clinical 
applications of the APM include study of the vocal load 
in singers, use of vocal dosimetry in forensic medicine, 
real-time biofeedback studies of voice and study of voice 
parameters as well as vocal load in teachers. 
Carrol et al. (2006), in their evaluation of vocal load in 
professional and semi-professional classical singers, 
suggested that dosimetry appears to be an effective tool 
for data collection on prolonged use of the voice and 
for accurate evaluation of vocal load even in the case 
of singers 11 . In this case, since the APM is easy to wear 
even under stage costumes (in the case of classic mu- 



Table I. Variation of FO mode, FO average and Sound Pressure Level (mean ± SD) during a five-hour working day in healthy teachers and in teachers with 
vocal nodules. 

Healthy teachers 



Parameters 


First 


hour 


Second hour 


Third 


hour 


Fourth hour 


Fifth 


hour 


FO Mode 


217.1 


±4.8 


217.2 ±4.3 


218.7 


±4.1 


225.2 ±1.9* 


227.1 


±2.1 + 


FO Average 


220.2 


±1.4 


220.9 ±1.3 


222.4 


±1.9 


227.0 ±3.1* 


228.8 


±2.5+ 


SPL 


75.4 


±2.6 


75.9 ±1.9 


76.8 


±2,0 


80.9 ± 2.0* 


82.7 


±1.6+ 


Teachers with vocal nodules 


















FO Mode 


218.3 


±4.7 


217.6 ±4.6 


216.2 


±5.7 


214.2 ±5.7 


205.9 


±5.4* 


FO Average 


220.9 


±6.3 


219.2 ±4.8 


217.7 


±5.4 


215.6 ±5.2 


207.3 


±6.1* 


SPL 


73.3 


±2.6 


72.1 ±2.2 


71.3 


±2.2 


69.4 ±2.4 


65.7 


±1.2§ 



* Fourth hour versus First, Second, Third hour: p < 0.02; f Fifth hour versus First, Second, Third hour: p<0.01; t Fifth hour versus First, Second, Third, Fourth hour: p < 0.05; § 
Fifth hour versus First, Second, Third, Fourth hour: p = 0.01. 
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sic singers), it is a valid device for understanding voice 
usage during artistic performances. However, for a 'nu- 
meric' calculation of vocal doses through analysis of D c 
and D d in the case of singers of all types of music, it 
might be useful to understand what limits the vocal load 
must not exceed, so that they can programme periods 
of rest for their voices and consequently avoid organic 
damage to vocal cords. 

APM permits monitoring voice usage during working 
hours. The possibility to quantify voice load by assessing 
the cycle dose and distance dose, and obtain parameters 
such as mean intensity, mean fundamental frequency, 
variations of FO and SPL over time, total duration of the 
phonation and its density (studied on both the total re- 
cording time and during working hours) may help to solve 
many of the problems mentioned above, and supply data 
for an objective evaluation of dysphonia even in the field 
of forensic medicine. 

The APM can provide a real-time vibrotactile feedback 
that is useful for rehabilitation purposes both in the reha- 
bilitation of dysfunctional dysphonia, and in establishing 
adequate post-surgical rehabilitation 12 . The vibrotactile 
device associated with the vocal dosimeter is a small box 
connected to the APM and is attached to the patient's belt 
or to the strap of the bag that contains the APM itself. 
Both the APM and the vibrotactile device are very light, 
and even when worn together do not hinder normal ac- 
tivities or work during recording. The device begins to vi- 
brate when the patient's voice exceeds the sound pressure 
and/or fundamental frequency levels previously set by the 
clinician. Moreover, the instrument can be set so that the 
vibration starts when vocal intensity and/or fundamental 
frequency drop below the pre-set values. The rationale for 
this biofeedback is to help the patient establish and be- 
come accustomed to new phonatory behaviours outside 
a clinical environment (e.g. speaking at a softer or louder 
amplitude). Furthermore, if one of the main causes associ- 
ated with the dysphonia is excessive vocal load, then hav- 
ing the possibility of being warned by a vibration when 
the vocal intensity and fundamental frequency limits are 
exceeded (particularly during working hours) will allow 
the subject to modify the use of his voice and, conse- 
quently, reduce both overload and abuse of the laryngeal 
apparatus. 

It is important to define the safe limits of the vibration 
dose and phonation time to avoid damage to the phona- 
tory organs in certain categories at risk 8 13 (e.g. teachers), 
and the device has the potential to objectively measure 
voice parameters even while the subject is teaching. There 
are few publications in the literature about the use of an 
APM for studying voice and vocal load in teachers. Re- 
cently, using APM Morrow and Connor showed that in 
elementary school music teachers the use of electric voice 
amplification significantly reduces mean vocal intensity, 
phonation time, cycle dose and distance dose 14 15 . 



Our data are preliminary, but promising, since the APM 
has shown to be useful for confirming available data that 
show how the FO and voice intensity values in teachers 
with a negative anamnesis for voice disorders, laryngeal 
dysfunctions or vocal fold pathologies increase as their 
teaching hours pass. Some authors have demonstrated that 
the fundamental frequency (FO) and intensity (dB SPL) 
in teachers gradually increases from the first hour of les- 
sons to the fifth, whether electric amplifiers are used or 
not 16 ~ 18 , possibly in an attempt to keep the attention of the 
students as the hours pass and also in the attempt to over- 
come voice fatigue. Some authors have reported an in- 
crease in fundamental frequency (FO) and sound pressure 
level (SPL) after long periods of voice usage, whether for 
professional use or not 19 20 . In fact, after a substantial vo- 
cal load, changes in the voice that are characteristic of 
hypertone in the laryngeal structures have been described 
(especially in females), though episodes of post-stress vo- 
cal hypotonia have also been reported 21 . Furthermore, the 
APM demonstrated that in the case of the teachers with 
cord nodules there was a progressive drop in intensity and 
in FO during teaching, while vocal load and phonation 
time did not vary significantly. Though these data must 
be evaluated further on larger groups of subjects, they al- 
ready show that periods of rest (or intervals of temporary 
interruption) should be considered in the case of teachers 
who have vocal fold disorders, and that electric voice am- 
plifiers should be used during lessons. The APM data may 
be useful in the future for establishing vibration dose safe 
limits, and, consequently, for recommending phonation 
times to protect teachers from the risk of further damage 
to their voice. 
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